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A SCALABLE CLOCK DISTRIBUTION FOR MULTIPLE CRU 

ON THE SAME CHIP 

5 

BACKGROUND OF THE INVENTION 

This invention relates generally to digital data 
communications systems, and more particularly to the recovery of 
clock and data signals from serial digital data streams. 

10 In many communications systems, digital data is often 

transmitted without a clock signal that would otherwise indicate 
the sequence of data bits. This increases available bandwidth 
over a transmission medium through avoidance of transmission of 
an attendant clock signal, A non-return to zero (NRZ) data format 

15 is an example of a transmission scheme that does not require that 
a clock be transmitted along with the data. 

The data signals, in the form of bits, are transmitted 
approximate a predetermined transmission rate. The transmission 
rate is determined by a clock in a data transmitter. Initially, 

2 0 it might seem possible to simply equip a data receiver with a 

clock operating at the same frequency as the clock in the data 
transmitter. Thus, data would automatically be received at the 
same rate that it was transmitted. However, even very small 
variations in the frequency of either of the two clocks would 

2 5 likely render this scheme unusable. A number of factors can lead 

to clock frequency variation, such as the age of the clocks and 
associated components, manufacturing tolerances, and 
environmental conditions such as temperature. In addition, if 
the clocks differ in phase by any substantial amount, data 

3 0 sampling could occur during data transitions, with resulting 

errors . 

Since the data signals are transmitted approximate a 
predetermined rate, it is possible to extract timing information 
from the data stream itself. Using the data signal to extract 
35 timing information can be accomplished using phase locked loop 
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approaches. The extracted timing information is used to recover 
the sequence of the bits in an incoming serial data transmission. 

5 As demand for high speed communications continues to grow, 

multiple data streams are often used to transmit information. In 
order to simultaneously recover the information contained in each 
data stream, the timing information needs to be extracted from 
each data stream. This generally requires multiple phase locked 

10 loops, as at least one is ordinarily used for each data stream 
with the phased locked loops including clock generators such as 
oscillators, phase detectors, filters, etc.. For communication 
systems having multiple data streams received by a single chip, 
the number of such elements that can be effectively placed on a 

15 single chip is constrained by area constraints, the package 
technology used, and by the thermal requirements of the system. 

BRIEF SUMMARY OF THE INVENTION 

The present invention provides for a scalable clock 

20 recovery system able to be implemented on a single chip. In one 
embodiment the invention comprises a clock master unit (CMU) , a 
plurality of clock recovery units (CRU) , and a clock distribution 
network. The CMU generates master clock signals, providing them 
to the clock distribution network. In one embodiment the master 

25 clock signals comprise a truth clock signal and a quadrature 
clock signal. The CRUs receive the master clock signals from the 
clock distribution network. The CRUs use the master clock signals 
to generate a recovered clock signal. In one embodiment, multiple 
levels of mixing are used to generate the recovered clock signal. 

3 0 The recovered clock signal can be used to recover data from an 
incoming serial data stream. 

A further embodiment of the present invention comprises a 
clock generation system. The clock generation system includes 
means for generating multiple out of phase clock signals. The 

35 clock signals are approximate a predetermined frequency. An 
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embodiment of the invention also comprises means for producing 
a recovered clock signal. The means for producing a recovered 
5 clock signal uses at least two clock signals generated by the 
means for generating multiple out-of -phase clock signals. The 
recovered clock signal has a selectively variable phase and a 
frequency approximate the frequency of the multiple out-of -phase 
clock signals. 

10 These and other aspects of the present invention will be 

more fully appreciated in view of the attached figures briefly 
described and in the following detailed description. 

DESCRIPTION OF THE DRAWINGS 
3_5 FIG. 1 is a top-level block diagram of an implementation 

incorporating multiple CRUs on a single chip in accordance with 
the present invention. 

FIG. 2 illustrates a block diagram of an embodiment in 
accordance with aspects of the present invention having a clock 

2 0 master unit distributing two clock signals to multiple CRUs on 

the same chip. 

FIG. 3 illustrates a block diagram of an embodiment of a 
CRU used in the the system of FIG. 2. 

FIG. 4 illustrates a semi-schematic block diagram of an 
25 embodiment of a mixer used in the CRU of FIG. 3. 

FIG. 5 is a timing diagram of truth and quadrature clocks 
generated by the CMU of FIG. 1. 

3 0 DETAILED DESCRIPTION 

A top-level block diagram of an embodiment of the present 
invention can be seen in Fig. 1. In Fig. 1, multiple data streams 
11 are received from a transmission medium (not shown) by a chip 
13. A clock master unit (CMU) 14 located on the chip receives a 
35 reference clock signal 15 from a voltage controlled oscillator 
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(VCO) (not shown) . The reference clock signal is approximate an 
expected transmission frequency of the serial data signals 

5 provided by the multiple data streams. 

The CMU uses the reference clock signal to form master clock 
signals. The CMU provides the master clock signals to a clock 
distribution network 17. The clock distribution network couples 
the CMU to a plurality of clock recovery units (CRUs) 19a-d that 

10 are on the same chip as the CMU. Each of the CRUs uses the master 
clock signals and the incoming serial data stream to construct 
a regenerated clock signal. The regenerated clock signals are 
used to recover data from the incoming serial data stream. 

As can be seen from Fig. 2, a CMU 21 receives a reference 

15 clock signal 23 from a VCO (not shown) . The CMU generates two 
master clock signals 25a, b from the reference clock signal. In 
a preferred embodiment, the two master clock signals are a truth 
clock signal and a quadrature clock signal. The truth clock 
signal and the quadrature clock signal are at the same frequency, 

20 but have a constant phase difference of 90 degrees. 

Fig. 5 is a timing diagram of a truth clock signal 61 and 
a quadrature clock signal 63. As illustrated, the truth clock 
leads the quadrature clock by 90 degrees. It is also possible for 
the truth clock to lag the quadrature clock by 90 degrees. In 

25 other embodiments other, even non-constant, phase differences are 
used. 

Returning now to Fig. 2, both the truth clock signal and the 
quadrature clock signal are provided to a clock distribution 
network 27. The clock distribution network provides the master 

30 clock signals to multiple CRUs 2 9a-d on the chip. The phase 
relationship between the master clock signals is maintained over 
the clock distribution network. In order to maintain the phase 
difference between the master clock signals, the clock 
distribution network uses equal length or very nearly equal 

35 length transmission lines for each clock signal, and high speed 
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clock buffers to maintain signal strength. Other clock 
distribution techniques well known in the art are also employed 

5 to help maintain a constant phase difference between the truth 
and quadrature clocks. Each CRU receives both the truth clock 
signal and the quadrature clock signal in addition to a serial 
data stream 26a-d. The recovered clock 28a-d is used to recover 
data from an incoming serial data stream. 

IQ As can be seen from Fig. 3, each CRU 3 0 includes a phase 

detector 31, a mixer 33 and a filter/accumulator 35. Also as can 
be seen, each CRU additionally receives an incoming serial data 
signal 37 and the master clock signals 39a, b. The mixer receives 
the master clock signals, the output from the f ilter/accumulator 

15 and produces the recovered clock. The phase detector receives the 
serial data signal and the recovered clock and provides phase 
data to the f ilter/accumulator • 

The phase detector receives both the incoming serial data 
signal and the recovered clock signal generated by the mixer. The 

2 0 phase detector compares the relative phases of the recovered 

clock and the incoming serial data signal. The phase detector 
outputs a signal 43 containing the phase difference information 
to the filter/accumulator. The filter/accumulator uses the phase 
difference information to instruct the mixer to adjust the phase 

25 of the recovered clock accordingly. 

For example, if the phase of the incoming data stream leads 
the phase of the recovered clock signal, the phase detector 
detects this phase difference. The phase detector sends a signal 
containing the phase difference information to the 

30 filter/accumulator which stores this information. If the clock 
continues to lag the data for a specified amount of time, the 
filter/accumulator sends a signal 45 to the mixer instructing it 
to choose the next phase of the recovered clock signal. The phase 
adjustment mechanism thus realigns the phase of the incoming data 

35 and the recovered clock. 
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Providing a signal instructing the mixer to adjust phase 
after the phase difference between the data and the recovered 

5 clock has remained for a specified time helps ensure that 
momentary or transient changes in phase will not require a change 
in the phase of the regenerated clock. A similar action occurs 
if the phase of the incoming data stream lags the phase of the 
recovered clock signal. Again, this mechanism realigns the phases 

10 of the incoming data and the recovered clock. Realignment helps 
to ensure that the incoming data signal is sampled at the correct 
time . 

Fig. 4 illustrates an embodiment of a mixer of the CRU of 
Fig. 3, The illustrated embodiment uses two levels of mixing to 
15 create a regenerated clock signal. In the first stage of mixing, 
master clock signals are mixed to generate a number of 
intermediate clocks. In the second stage of mixing the 
intermediate clock signals are mixed to form the regenerated 
clock. 

2 0 To generate the intermediate clock signals, master clock 

signals 47a, b are each provided to a plurality of amplitude 
adjusters 48a-481. In one embodiment the amplitude adjusters 
include an amplifier circuit. In another embodiment, the 
amplitude adjusters comprise an attenuator. The attenuators 
25 operate in parallel. That is, in one embodiment, the clock 
signals do not pass through multiple, cascaded attenuators. In 
addition, in one embodiment the attenuators may also invert the 
truth and quadrature clock signals. Amplitude attenuation can be 
accomplished through the use of resistors, by biasing transistors 

3 0 to operate in their linear range, or through the use of diodes 

or voltage regulators or by other methods well known in the art. 

In Fig. 4 a negative value in an attenuator, such as in 
attenuator 48k, indicates that the clock signal is inverted by 
the attenuator. The inversion is in addition to any amplitude 
35 adjustment that may be performed. Inverting elements such as CMOS 
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inverters, inverting operational amplifier based circuits, or 
other well-known methods may be used to invert the clock signals 

5 received by the attenuators . 

The outputs 49a-l of the attenuators are provided to N 
summers or mixers 51a-f . In one embodiment, the summers operate 
in parallel. That is, each summer has as its inputs various 
outputs of the attenuators. The summers may be implemented using 

10 operational amplifier based circuits, digital signal processing, 
or by other well known methods. 

In one embodiment, each summer receives two signals, one 
each from two attenuators. The first signal received by the 
summer is an amplitude adjusted truth clock signal. The second 

15 signal received by the summer is an amplitude adjusted quadrature 
clock signal. The amplitude adjusted signals received by each 
summer are summed by the respective summers. The output of each 
of the summers forms an intermediate clock signal. 

The N summers produce N intermediate clock signals 53a- f, 

20 with each summer producing an intermediate clock signal. The 
intermediate clocks have the same frequency, but differing 
phases. Each different phase is generated by summing different 
amounts of the truth and quadrature clocks. The attenuators 
allow different amounts of the truth clock and quadrature clock 

25 to be provided to the summers. Thus, different amounts of the 
truth and quadrature clocks are summed. The number of 
intermediate clocks used is determined by the desired resolution 
of the CRU. 

The summers may be viewed as being arranged in an order, 
30 from first to last. Using this construct, a first summer produces 
an intermediate clock signal with a first phase. Each summer 
produces an intermediate clock signal with a different phase. 
The last summer produces an intermediate clock signal with a last 
phase . 

35 
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As illustrated, the first summer 51a sums all of the truth 
clock, and none of the quadrature clock. The second summer 51b 

5 sums a large amount of the truth clock and a small amount of the 
quadrature clock. Subsequent summers in the order receive smaller 
amounts of the truth clock and larger amounts of the quardature 
clock. The penultimate summer 51e sums a small amount of the 
truth clock, and a large amount of the quadrature clock. The 

10 final summer 51f sums none of the truth clock and all of the 
quadrature clock. 

As an example, consider the case where N = 5. The first 
summer receives a truth clock signal weighted by 1, and a 
quadrature clock signal weighted by 0 and sums them. The second 

15 summer receives a truth clock signal weighted by .75, and a 
quadrature clock signal weighted by .25 and sums them. The third 
summer receives a truth clock signal weighted by .5, and a 
quadrature clock signal weighted by .5 and sums them. The fourth 
summer receives a truth clock signal weighted by .25, and a 

20 quadrature clock signal weighted by .75 and sums them. The final 
summer receives a truth clock signal weighted by 0, and a 
quadrature clock signal weighted by 1 and sums them. 

There are 5 outputs, one from each of the summers, each 
output forming an intermediate clock signal. The amount of truth 

2 5 and quadrature clock signals that are used varies for each 

intermediate clock signal generated. Since the truth and 
quadrature clock signals have different phases, each intermediate 
clock signal has a unique phase. 

In the embodiment described in the above example, the sum 
30 of the amount of truth clock signal used and the amount of 
quadrature clock signal used is equal to one. Additionally, in 
the example above, the difference in the amount of a particular 
clock signal summed from one summer to the next was a constant. 
In other embodiments the difference in the amount of a particular 

35 
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clock signal summed from one summer to the next is not a 
constant . 

5 For a 90 degree phase offset between the truth clock and the 

quadrature clock, different equations describe the generation of 
each 90 degrees of the intermediate clock signals. 

An equation describing the generation of an arbitrary 
intermediate clock phase, a, greater than 0"" and less than or 

10 equal to 90° is 

Pa=(iN-a)* Ptruth + (a) * Pquad 

Where Ptruth is the truth clock signal, and P^^^ is the quadrature 
clock signal. Also, a is an integer in the range from 0 to N, 
inclusive. The attenuators allow a to be varied, as described 
15 above. 

The attenuation of each clock signal that is input to the 
summer is determined by the variable a. As a varies from 0 to N, 
the amount of truth clock signal that is summed varies from 100% 
at a equals zero to 0% at a equals N. Thus, fractional amounts 

2 0 of the truth clock signal and the quadrature clock signal are 

summed to generate an intermediate clock signal. 

Discussing the embodiment of Fig. 4 with respect to the 
above, the first mixer stage provides outputs for each value of 
N. In the preferred embodiment, the truth clock signal and the 

25 quadrature clock signal have 90 degrees of phase offset. 
Consequently, the phases of the intermediate clock signals 
corresponding to a equals 0 to N are evenly distributed in the 
range of the clock cycle from greater than 0° and less than or 
equal to 90°. For intermediate clock phases other than in the 

30 range from greater than 0° and less than or equal to 90°, 
different equations describe the generation of the intermediate 
clock signals. 

To generate intermediate clock signals greater than 90° to 
less than or equal to 180° the same mixing mechanism is used, 
35 with the input sources being the quadrature clock signal and the 
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10 



complement of the truth clock signal; an equation describing this 
relationship is 

In this case a is an integer in the range from 0 to -N, 
inclusive. A negative value indicates the complement of that 
particular input. The attenuators allow a to be varied, as 
described above . 

The attenuation of each clock signal that is input to the 
summer is determined by the variable a. As a varies from 0 to -N, 
the amount of the complement of the truth clock signal that is 
summed varies from 100% at a equals zero to 0% at a equals -N. 
Thus, fractional amounts of the truth clock signal and the 
15 quadrature clock signal are summed. 

The first mixer stage provides outputs for each value of N. 
These outputs are the intermediate clock signals that can be seen 
in Fig. 4. In one embodiment, the truth clock signal and the 
quadrature clock signal have 90 degrees of phase offset. 
20 Consequently, the phases of the intermediate clock signals 
corresponding to a equals 0 to -N are evenly distributed in the 
range from greater than 90"" to less than or equal to 180 of the 
clock cycle. 

Thus, the first stage of the mixer generates intermediate 
25 clock phases between 0 and 180 degrees, as described above. Since 
the clock signals are odd functions in the range from 0° to 
360"", the intermediate clock phases greater than 180° degrees and 
less than or equal to 360° are the complement of the clock phases 
between 0"" and 180°. Thus, intermediate clock signals with phases 
greater than 180° and less than or equal to 270° are generated 
according to 



30 



Pa= (N- a)"^ Prru.h + (a) * Pguad 

Similarly, intermediate clock signals with phases greater 
than 27 0° and less than or equal to 360° are generated according 
35 to 
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Pa= (N-\- a)"^ Ptruth + {a) * Pquad 

Returning to Fig. 4, in the second stage of the mixer, the 

5 intermediate clock signals are themselves mixed in order to 
generate the recovered clock. Multiplexors 55a-f in the second 
stage of the mixer receive both true and complementary versions 
of the intermediate clock signals generated by the previous stage 
of the mixer. There are 4N multiplexors, corresponding to the 4N 

10 intermediate clock signals. In one embodiment, each multiplexor 
is a two input multiplexor. The multiplexor receives at its 
inputs a particular intermediate clock signal and the complement 
of that signal. A select signal 54a-f for each multiplexor is 
generated by the filter/accumulator. The select signal causes the 

15 multiplexor to output either the true or complementary version 
of the intermediate clock signal. By varying the select signals 
that the multiplexors receive, the filter/accumulator alters 
which intermediate clock signals are received by the summer 57 
in the second stage of the mixer. 

20 The output signals from the multiplexors 59a-f are provided 

to the summer. The summer sums the 4N signals received from the 
multiplexors to form the regenerated clock signal 61. Altering 
the intermediate clocks summed in the second stage of mixing 
varies the phase of the recovered clock. Thus, by varying the 

25 select signals that the multiplexors receive, the 
filter/accumulator can change the phase of the regenerated clock. 
The regenerated clock signal can then be used to recover the 
incoming serial data. 

The present invention therefore provides a scalable clock 

30 recovery system able to be implemented on a single chip. Although 
the invention is described in certain specific embodiments, it 
should be recognized that the invention may be practiced other 
than as specifically described. Thus, the invention should be 
considered to be the claims and their equivalents supported by 

35 this disclosure. 
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